Part-Guided Attention Learning for Vehicle Instance Retrieval
نویسندگان
چکیده
Vehicle instance retrieval (IR) often requires one to recognize the fine-grained visual differences between vehicles. Besides holistic appearance of vehicles which is easily affected by viewpoint variation and distortion, vehicle parts also provide crucial cues differentiate near-identical Motivated these observations, we introduce a Part-Guided Attention Network (PGAN) pinpoint prominent part regions effectively combine global local information for discriminative feature learning. PGAN first detects locations different components salient regardless identity, serves as bottom-up attention narrow down possible searching regions. To estimate importance detected parts, propose Part Module (PAM) adaptively locate most with high-attention weights suppress distraction irrelevant relatively low weights. The PAM guided identification loss therefore provides top-down that enables attention be calculated at level car other Finally, aggregate features together improve performance further. combines part-guided bottom-up top-down attention, in an end-to-end framework. Extensive experiments demonstrate proposed method achieves new state-of-the-art IR on four large-scale benchmark datasets.1
منابع مشابه
Multiple-Instance Learning for Music Information Retrieval
•Multiple-instance learning (MIL) algorithms train classifiers from lightly supervised data – collections of instances, called bags, are labeled rather than the instances themselves – algorithms can classify bags or instances, we focus on instances •Motivation for applying MIL to MIR: – propagate metadata between granularities: artist, album, track, 10-second clip – e.g. train clip classifiers ...
متن کاملAttention-based Deep Multiple Instance Learning
Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the bag label probability is fully parameterized by neural networks. Furthermore, we propose a neural network-based permutation-invariant aggregation operator tha...
متن کاملAttention Guided Deep Imitation Learning
When a learning agent attempts to imitate human visuomotor behaviors, it may benefit from knowing the human demonstrator’s visual attention. Such information could clarify the goal of the demonstrator, i.e., the object being attended is the most likely target of the current action. Hence it could help the agent better infer and learn the demonstrator’s underlying state representation for decisi...
متن کاملLearning Robust Hash Codes for Multiple Instance Image Retrieval
In this paper, for the first time, we introduce a multiple instance (MI) deep hashing technique for learning discriminative hash codes with weak bag-level supervision suited for large-scale retrieval. We learn such hash codes by aggregating deeply learnt hierarchical representations across bag members through a dedicated MI pool layer. For better trainability and retrieval quality, we propose a...
متن کاملOptimization Strategies for Instance Retrieval
In this paper new techniques for optimizing instance retrieval in DL systems are described. The algorithms are evaluated with application examples from a natural language processing application.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Intelligent Transportation Systems
سال: 2022
ISSN: ['1558-0016', '1524-9050']
DOI: https://doi.org/10.1109/tits.2020.3030301